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Abstract 

Experiments are once again under way at the LHC. This time around, however, the mood in 
the high-energy physics community is pessimistic. There is a growing suspicion that naturalness 
arguments that predict new physics near the weak scale are faulty and that prospects for a new 
discovery are limited. We argue that such doubts originate from a misunderstanding of the 
foundations of naturalness arguments. In spite of the hrst run at the LHC, which aggravated the 
little-hierarchy problem, there is no cause for doubting naturalness or natural theories. Naturalness 
is grounded in Bayesian probability logic — it is not a scientihc theory and it makes no sense to 
claim that it could be falsihed or that it is under pressure from experimental data. We should 
remain optimistic about discovery prospects; natural theories, such as supersymmetry, generally 
predict new physics close to the weak scale. Furthermore, from a Bayesian perspective, we briefly 
discuss ’t Hooft’s technical naturalness and a contentious claim that the little-hierarchy problem 
hints that the Standard Model is a fundamental theory. 
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I. INTRODUCTION 


This summer, after a two-year hiatus, collisions resumed at the LHC at a centre-of- 
mass energy of ^/s = 13TeV. Unlike in the hrst run, the mood in the high-energy physics 
community is gloomy. The optimism that characterized the hrst run (see e.g.. Ref. [1, 2]) was 
dampened by evidence that there is no new physics near the weak scale and the suspicion 
that faith in the principle of naturalness was misplaced [3, 4]. This gloom is not universal; 
there is a rift in the community and a few remain upbeat [5, 6]. 

The principle of naturalness emerged in high-energy physics in the late 1970s when 
Weinberg [7, 8], Susskind [9] and Gildener [10], amongst others, identihed a “naturalness” 
problem concerning the mass of a fundamental scalar held. The natural scale for such a 
mass in an effective theory is that at which microscopic physics is important, the cut-off 
scale, because of quadratic quantum corrections. In the Standard Model (SM), the mass of a 
complex scalar held determines the weak scale [11-13]. Thus, the cut-off scale ought to be 
close to the weak scale, or else the SM must be hne-tuned. However, without new physics, 
we expect the cut-off scale in the SM to be around the Planck scale. This dichotomy became 
known as the “hierarchy problem” in reference to the hierarchy between the weak scale and 
the Planck scale. 

As an example, consider the physical mass of a scalar held in an effective scalar-held 
theory. The physical mass is the pole in the two-point function; diagrammatically. 



The loop results in a quadratic correction to the physical mass: 

"^Phys = + . . . (1) 

If the physical mass is to be much smaller than the cut-off, , we require miraculous 

cancellations between the bare mass and the quadratic correction. That this is a problem 
hinges upon a notion of “naturalness” — a theory is natural if its generic prediction for the 
weak scale is correct, and hne-tuned otherwise. 

In the ensuing decades, the hierarchy problem shaped thinking in theoretical physics and 
precipitated phenomenological interest in so-called natural theories. Popular natural theories 
included supersymmetric theories [14-16], in which quadratic corrections to scalar masses 
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vanish [17]; technicolor theories [7-9, 18-20], in which there are no fnndamental scalars at 
the weak scale; and large extra dimensions [21, 22], in which the Planck scale is close to the 
weak scaled 

However, in the years following LEP experiments (see e.g.. Ref. [24, 25]), beginning around 
the year 2000, a “little-hierarchy problem” emerged (see e.g.. Ref. [26]). Since LEP forbade 
new physics close to the weak scale, there must be a little hierarchy between the weak scale 
and the scale of new physics, and the weak scale might require “fine-tuning” such that it is 
less than the scale of new physics. This problem was exacerbated in the last few years by 
the hrst run of LHC experiments (see e.g.. Ref. [27, 28]). The discovery of a Higgs boson 
in 2012 [29, 30] suggested that there is a fundamental scalar near the weak scale, but the 
absence of new physics suggested that the cut-off of the SM is not near the weak scale. As a 
result, there is increasing suspicion that naturalness was a flawed criteria (see e.g.. Ref. [31]). 

Indeed, there are at least two common responses to the little-hierarchy problem; 

1. The SM is not an effective theory; quadratic divergences are an unphysical artefact of 
regularization. With modifications to explain gravity, dark matter and remedy Landau 
poles, it describes arbitrarily microscopic scales without enormous radiative corrections 
to the weak scale. 

2. The SM is not a natural theory. This is not a problem; naturalness is an aesthetic 
principle — an unreliable, prejudiced criteria on which to build knowledge, which was 
falsified by collider experiments, as demonstrated by the little-hierarchy problem. 

These responses are often combined [32-48]. It is the second response that I wish to dispel. 
By following Jaynes’ Bayesian “logic of science” [49] — and casting the hierarchy problem in 
Bayesian language — it is apparent that the little-hierarchy problem is a little problem, and 
not a cause for doubting naturalness or natural theories. In passing, I will remark, somewhat 
unfavourably, upon the first response from a Bayesian perspective. 

II. BAYESIAN PROBABILITY LOGIC 

Bayesian probability extends absolute truth and falsehood by permitting a numerical 
measure of degree of belief. We assign numerical measures of our degrees of belief — 
^ Kaplan et al recently presented a novel class of natural theories — relaxion theories [23] — in which a 
back-reaction to electroweak symmetry breaking enforces a small weak scale. 
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“probabilities” — to scientific theories. Although we cannot verify a theory with data, by 
availing ourselves of Bayes’ theorem, we may calculate that a theory is more probable than a 
rival theory in light of data. 

Probability logic is by now textbook material; we follow Jaynes [49] and Gregory [50]. The 
aim is to construct a unique system of probability logic that follows from modest desiderata: 


1. Our degree of belief in a proposition can be represented by a single real number. 

2. Although our probability logic shall be quantitative, it must qualitatively agree with 
common sense. 

3. Our probability logic shall be consistent in that we require that 

(a) Every possible approach to a calculation must lead to an identical result. 

(b) All relevant evidence must be considered. We cannot, by hat, omit relevant 
information. 

(c) Every equivalent state of knowledge must lead to an identical result. 


These desiderata entail a unique system of logical operations including products and sums. 
The proofs are tedious; they consist of compiling exhaustive lists of possible forms for an 
operation, then rejecting possible forms one by one with the desiderata. Bayes’ theorem. 


p{A\B) 


p{B \ A) X p{A) 

p{B) 


( 2 ) 


is a result of probability logic. The signihcance of Bayes’ theorem is that it justihes weak 
inductive syllogisms such as; 


• If A implies B and B, then A more probable, and 

• If A implies B and not A, then B less probable, 
as well as strong deductive syllogisms. 

We must admit, however, that the character of our probability — it is assigned to any 
proposition or scientihc theory — is not in accord with the conventional frequentist picture 
of probability or the conventional methodology of science. Indeed, Popper, amongst others, 
offered brief scathing criticisms of Bayesian probability in science [51]; 
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/ do not believe that it is possible to construct a concept of the probability of 
hypotheses ... nothing is gained by replacing the word “true” by the word “probable”, 
and the word “false” by the word “improbable”. 

This scorn was unwarranted. Bayesian probability rehabilitates inductive reasoning, arguably 
solving Hume’s problem of induction [52], and Barman [53] notes that Bayesianism is now 

... the name stitched to the Jolly Roger of a leading school of statistics and 
what is arguably the leading view amongst philosophers of science concerning the 
confirmation of scientific hypotheses and scientific inference in general. 


Without inductive reasoning, a theory is either falsified and thus abandoned or unfalsified 
and thus viable but nothing more (see e.g.. Ref. [54]). For example, the SM Higgs boson was 
unfalsified prior to its discovery and unfalsified after its discovery — without induction, the 
discovery at the LHC cannot alter our belief in the SM Higgs boson. 

We will apply Bayes’ theorem in the form 


Posterior 

^if\F) 


Evidence 


p{D\T) 

p{D) 


Normalization 


Prior 

X J(T). 


( 3 ) 


The posterior is our degree of belief in a theory, T, in light of the experimental data, D — we 
judge a theory with our posterior belief. The posterior is the product of the evidence — the 
probability of obtaining our data assuming the theory — and our prior belief in the theory. 

The prior is a controversial yet critical ingredient. If we supply our belief in a theory 
prior to an experiment, our belief after the experiment is dictated by Bayes’ theorem. Bayes’ 
theorem cannot, however, dictate our prior beliefs. In keeping with our desiderata that 
identical states of knowledge ought to lead to identical results, priors ought to reflect our 
state of knowledge or ignorance rather than our subjective belief. In other words, I advocate 
objective priors. Although we might, in a qualitative manner, restrict permissible priors, 
there might not be a unique prior that qualitatively reflects our state of knowledge. If 
our prior beliefs are personal opinions, we cannot deliver objective conclusions, and Bayes’ 
theorem may be inappropriate for science. After all, we want science to deliver objective 
truths about the world. However, we could reach intersubjective agreement in light of data 
despite variation in prior beliefs. Indeed, Barman [53] argues that; 
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It is a fact of life that scientists start with different opinions. To try to quash this 
fact is to miss the essence of scientific objectivity: the emergence of an evidence 
driven consensus from widely differing initial conditions. 

Indeed, in many cases the correspondence between a prior and our knowledge may be fuzzy 
and we must take solace in the fact that priors are “washed out.” This is arguably a reflection 
of scientific practice. 


III. BAYESIAN MODEL COMPARISON 


The Bayesian approach is that to compare two theories, we ought to calculate the ratio of 
their probabilities conditioned on all relevant experimental data. 


PjTg I D) 

Posterior odds 


p{D\Ta) 

Bayes factor 


PjTg) 

Prior odds 


(4) 


This trivializes model comparison — we simply favour the model that is most probable. 

The fact that we consider a ratio is important: it eliminates an unknown normalization 
constant in Eq. (3). If we were to consider a single theory, the posterior and prior probabilities 
must equal unity, that is, certainty. We might instead consider the Bayesian evidence for a 
single theory, but this is not the quantity of interest — it is the probability of obtaining the 
observed data. This point is stressed by Jaynes [49]; 


.. .it is meaningless to ask how much those facts [the data] tend “in themselves” 
to confirm or refute Hq [a hypothesis]. Not only the mathematics, but also our 
innate common sense (if we think about it for a moment) tell us that we have not 
asked any definite, well-posed question until we specify the possible alternatives 
to Hq ... mere improbability, however great, cannot in itself be the reason for 
doubting Hq. 


Furthermore, if the evidence is a probability density as function of the data, it is a dimensionful 
number that depends non-trivially on our choice of parametrization of the data. It makes no 
sense to ask whether such an evidence is large or small. 
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IV. CASTING THE BIG-HIERARCHY PROBLEM IN BAYESIAN LANGUAGE 


The gist of the hierarchy problem and hne-tuning arguments emerge from Bayesian 
probability — they are not ingredients or desiderata. To see that the big-hierarchy problem 
emerges from Bayesian probability, consider the Bayesian evidence (the factor p{D \ T) in 
Eq. (3)) as a function of the data, as plotted in Fig. 1. Our data is the observed weak scale, 
and we consider the SM and a natural theory in which quadratic corrections are truncated 
before the Planck scale. As a function of the data, the evidence is normalized to unity. 
Because of quadratic corrections, the SM wastes probability mass near the Planck scale, 
such that the evidence at the correct weak scale is minuscule — its generic prediction is 
that the weak scale is near the Planck scale.^ Natural theories, on the other hand, truncate 
quadratic corrections at a scale below the Planck scale. Their generic prediction for the 
weak scale is broad — the distribution spans the lowest scale to the Planck scale — but the 
probability density at the correct weak scale is signihcantly greater than that in the SM. The 
LHC constraints slightly weaken our preference for natural theories versus the SM, but the 
preference for natural theories remains colossal. For example, even in light of LHC results, 
we ought to place about thirty orders of magnitude more faith in the constrained minimal 
supersymmetric Standard Model than in the SM [55]. 

We calculated the evidences in Fig. 1 in the usual manner by integrating a likelihood 
function over a theory’s parameter space, denoted by 6, with a suitable measure (the prior 
distributions): 

pihgMz \T) = Mz j dep{e I T) X p{Mz \ 9). (5) 

For full details of the calculations in Fig. 1, see Ref. [55]. We are not suggesting that our 
priors correspond to physical probabilities with which nature picks Lagrangian parameters. 
Our priors merely reflect our ignorance about which values of the theory’s parameters would 
be realized, were the theory true. Thus, our priors would not be “wrong” if we were oblivious 
to a mechanism in nature that picks Lagrangian parameters such that there is a hierarchy 
between the weak scale and the Planck scale. If we found such a mechanism, it would be 
favoured by the Bayesian evidence and solve the hierarchy problem. 

^ I assume that the cut-off in the SM is near the Planck scale. If it were higher, the SM’s generic prediction 
for the weak scale would be even worse; if it were lower, it would imply new physics below the Planck scale. 
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Figure 1: Illustration of the evidence, interpreted as a sampling distribution. The SM squanders 
probability mass at the Planck scale. A natural theory, on the other hand, evades quadratic 
corrections. The approximately linear behavior in the natural theory results from the fact that the 
weak scale is the sum of a bare mass and a correction that could be much less than the Planck 

scale [55]. 

With Bayesian probability, the naturalness of a theory is its probability in light of data. 
Bayesian probability trivializes naturalness arguments — it is a tautology that natural 
theories are more probable than unnatural theories. This is related to Occam’s razor (see e.g., 
Ref. [56]). Bayesian probability justifies Occam’s razor and provides an automatic razor (see 
e.g., Ref. [57]) that favors simple theories. Simple theories make clear predictions such that 
their evidence is sharply peaked; whereas complicated theories make broad predictions such 
that their evidence is thinly spread. In Bayesian language, naturalness and simplicity are 
synonymous and the hierarchy problem is a misnomer; there is no problem — the “problem” 
is simply the observation that theories that generically predict the correct weak scale ought 
to be favoured.^ To worry about naturalness is to worry that there might exist a model 


^ There is confusion about whether the weak scale is predictable or calculable in the SM or in a supersymmetric 









favoured by the evidence, as in Fig. 1. 

One might wonder why the weak scale is of special concern; after all, perhaps all measured 
quantities require hne-tuning. In the SM, for example, we must tune the Yukawa couplings 
across several orders of magnitude to agree with the measured fermion masses. The answer 
is that naturalness could concern any measured quantity; the weak scale is special only 
because the SM’s prediction for the weak scale is awful compared to that in, for example, a 
supersymmetric theory. If a new theory made generic predictions for the Yukawa couplings 
that resulted in precisely correct fermion masses, it would be favoured by the Bayesian 
evidence relative to the SM. We do not know of such a theory. 

A numerical measure of hne-tuning in supersymmetric theories was developed in the 
1980s by Ellis [58] and by Barbieri and Giudice [59]. Their measure (henceforth a derivative 
measure) was based upon derivatives of the weak scale with respect to a theory’s fundamental 
parameters (denoted by 6): 

. dlnMz 

A = - 

* d\ne. 

Remarkably, a similar derivative measure results from calculations of the evidence in Bayesian 
probability (see e.g.. Ref. [55, 60-64]), validating the intuition behind derivative measures. 
The factor results from a Dirac delta function of the measured weak scale integrated with 
respect to a parameter, e.g., the supersymmetric /i-parameter. If we pick a logarithmic prior 
for the /r-parameter, we hnd that 


p{Mz \ MSSM) ^ J ^ i^z ~ ^z) p(h)d/i 

^ (7) 

J Z 1-^ 

1 

Note, however, that no derivative measures for parameters other than the chosen parameter 
(in this case the /i-parameter) emerge in the Bayesian evidence. 

By now, however, there are numerous derivative measures (see e.g.. Ref. [65]) — derivative 
measures Aj in Eq. (6) could be averaged, root-mean-square averaged or minimized, and 
evaluated at the weak-scale or the high-scale — built upon a farrago of ideas about naturalness 
theory. The situation is identical in each theory: if one specifies Lagrangian parameters and a cut-off scale, 


one can calculate the weak scale. I specify prior distributions for the Lagrangian parameters and cut-off 
scale and calculate the Bayesian evidence. By “generic prediction,” I refer, roughly, to the mode or shape 


in the Bayesian evidence as a function of the data. 
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beyond those captured in Bayesian probability. From the Bayesian point of view, those 
ideas are epistemically irrelevant and specious. This is not a problem or a failure of 
Bayesian probability to fully account for hue-tuning; our goal is not to emulate foibles of 
human reasoning about hue-tuning, but to elucidate correct reasoning. Of the miscellany 
of derivative measures, the best measures are those closest to the spirit and formalism of 
Bayesian probability. 

Ghilencea et al [66-71] claim that traditional chi-squared analyses of supersymmetric 
theories ignore a contribution to the chi-squared from hne-tuning of the electroweak scale. 
They summarize their work as showing that hne-tuning “can rule out a model without a 
detailed chi-squared analysis” in a frequentist analysis [67]. The test-statistic in Ghilencea et 
al includes the logarithm of the Barbieri-Giudice measure. In A. Ghilencea et al approximate 
the sampling distribution for the Z-boson mass with a Dirac delta function — it is not a 
random variable and would not vary between repeat experiments. Thus, the quantity In A 
is not a random variable and it makes no sense to claim it could be used as a test-statistic 
in a frequentist analysis. If the Z-boson mass were considered to be a Gaussian random 
variable, a chi-squared would be a sensible test-statistic, but this chi-squared would be zero 
because supersymmetric theories are hne-tuned to predict the measured Z-boson mass. In 
other words, contrary to the claims in Ghilencea et al, there is no hne-tuning penalty in a 
frequentist analysis. Fine-tuning penalties occur only in Bayesian probability. 


V. THE LITTLE-HIERARCHY PROBLEM: WHEN SHOULD WE WORRY ABOUT 
FINE-TUNING? 

We stressed that our degree of belief ought to be relative, and that, in Bayesian language, 
to worry about hne-tuning is to worry that there might exist a theory favoured by the 
posterior odds. The fact is that unless a theory exactly reproduces the observed data with no 
adjustable parameters (I refer to this as a spot-on model), we might always wonder whether 
a better model exists. That is, there is no concept of “theory conhrmation” in Bayesian 
probability — our favoured theory is only the most probable theory so far, and we might 
hnd a better one. In that sense, hne-tuning problems are never resolved. There is no point 
at which we declare that we have found a natural, untuned theory (unless we hnd a spot-on 
model). 
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When, then, should we worry about fine-tuning? Were we to discover a supersymmetric 
theory near the weak scale, should we still worry that there could be a more natural 
explanation for the hierarchy between the weak scale and the Planck scale? To answer this, 
consider the big-hierarchy problem in the SM. This fine-tuning problem was immediately clear 
without comparisons to alternative models because one can immediately conceive of “natural” 
theories in which the quadratic corrections are absent. We should always attempt to find more 
natural theories, but the issue is pressing only if we can readily conceive of a more natural 
theory. This is not the case with the little-hierarchy problem (or indeed with the cosmological 
constant [72]); it is a challenge to construct a theory that is favoured by the posterior odds 
relative to natural theories that suffer from a little-hierarchy problem. Admittedly, this is 
somewhat oxymoronic — we should only worry about fine-tuning problems if we can envisage 
solutions. The point is, however, that fine-tuning problems are not problems; they are simply 
the observation that a more probable theory might exist. If we struggle to construct such 
a theory, there is no immediate problem, though if in the future we find one, it would be 
favoured. 

If we discovered evidence for a natural theory slightly above the weak scale, at 10 TeV, for 
example, there might be puzzlement about why it was at 10 TeV rather than 1 TeV; certainly 
some might ask, why is there a little-hierarchy? Why is nature described by a natural theory 
at 10 TeV rather than a natural theory at 1 TeV? After all, would not a natural theory 
at 1 TeV be most natural? Our answer to the latter question is negative — naturalness is 
nothing but Bayesian evidence. The evidence for a theory that predicts new physics at 1 TeV 
would be minuscule if we saw new physics only at 10 TeV. The former question, on the other 
hand, is purely metaphysical; it cannot be posed in Bayesian language. 


VI. THE STANDARD MODEL IS NOT AN EFFECTIVE THEORY 

Let us turn to the first response to the little-hierarchy problem outlined in the introduction — 
that the SM is not an effective theory. In this case, presumably, finite renormalized parameters 
in the SM, in a particular renormalization scheme, are fundamental parameters. This is at 
odds with the modern understanding of renormalization; in this approach, renormalization is 
viewed as it was prior to Wilson’s insights [73] — an algorithm for “sweeping the infinities 
under the rug.” What are sensible priors for the theory’s parameters? What, then, is the 
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generic prediction of snch a model for the weak scale? This is problematic; we claim that 
this is a fnndamental theory, closing the door on any mechanism that conld determine the 
parameters, indicate that dimensionless parameters might be close to one, or bonnd the 
parameters. The priors for the parameters in snch a theory and its prediction for the weak 
scale are ill-dehned improper distribntions — inhnitesimal over an inhnite range.^ This 
prediction for the weak scale is much worse than that of the SM in Fig. 1. Admittedly, such 
a model may replicate our observed data, but even the SM is a more probable explanation 
of that data. If the SM is interpreted as an effective theory, it is reasonable that our prior 
distributions are proper distributions, that is, that our priors reach zero asymptotically, 
because a mechanism in the ultra-violet theory could determine or bound the parameters in 
the SM. This disfavours scales that far exceed the cut-off scale or couplings that far exceed 
unity. 


VII. TECHNICAL NATURALNESS 

Before summarizing, I briefly turn to “technical naturalness” — a notion of naturalness 
related to big and small numbers espoused by Dirac in the 1930s [74]. Dirac frowned upon 
any big or small fundamental parameters in a theory. From a Bayesian perspective, Dirac’s 
preference for numbers of order unity might be dismissed as an eccentric, subjective prior. 
Our priors ought to reflect our state of knowledge; we must ask ourselves whether there is 
sufficient reason for believing that small numbers are improbable in nature. It is plausible, 
but hardly compelling and possibly circular, that if our theory’s parameters were exact 
solutions of an unknown fundamental theory, exact solutions of order one might be more 
probable. 

Dirac’s idea was modernized by’t Hooft in the 1970s [75]. Unlike Dirac, ’t Hooft tolerated 
small numbers, but only if they were connected to a symmetry: 

at any scale ja, a physical parameter or set of physical parameters is allowed 
to be very small only if the replacement ai{p) = 0 would increase the symmetry 
of the system. 

^ This argument fails for quantum field theories with no adjustable parameters or a single adjustable 
parameter, such as QCD, because a single parameter is an arbitrary definition of a unit of mass. 
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’t Hooft’s modification could be justified by Bayesian probability. Because such a small 
parameter could originate from the microscopic details of a mechanism of symmetry breaking, 
we might find it, a priori, more plausible that it is small. 

In Sec. IV, naturalness was favoured in the evidence by a combination of prior beliefs 
and experimental data in the likelihood. In contrast, technical naturalness only concerns 
appropriate prior beliefs in a theory’s parameters. Dirac’s and’t Hooft’s ideas about technical 
naturalness hinge upon whether it is appropriate to penalize big and small numbers in our 
priors. Although such penalties could be justified in particular theories, there is no reason 
why we ought to pick priors that favour technical naturalness in all theories in high-energy 
physics. 


VIII. SUMMARY AND DISCUSSION 

In light of data from LEP and the LHC, many are questioning the principle of naturalness 
and the signihcance of the hierarchy problem — perhaps naturalness was an ill-dehned 
prejudiced basis for scientific knowledge? We have argued, however, that naturalness is not 
an aesthetic principle. Naturalness is grounded in Bayesian probability, which is arguably the 
unique framework for updating our opinions in light of experimental data, that is, the unique 
framework for scientihc inference. Because Bayesian probability is a mathematical framework 
rather than a scientihc theory, it makes no sense to argue that it has been “falsihed” because 
we observed “improbable” data. As a helpful analogy, suppose I claimed to have falsihed 
probability theory because I observed a coin land on heads 100 times in a row, which I 
deemed improbable. Besides my reasoning being somewhat circular, my observations could 
only discriminate between rival explanations for the behaviour of the coin. I might infer that 
the coin is biased, but I cannot conclude that I must reject probability theory. On the other 
hand, we found limited support for technical naturalness. 

The big-hierarchy problem is not in itself a problem; it is merely the fact that compared 
with supersymmetric or technicolor theories, the SM makes an awful prediction for the weak 
scale. The big-hierarchy problem in the SM is immediately clear without comparisons to 
alternative models because one can immediately conceive of “natural” theories in which the 
quadratic corrections are absent. In light of Mz/Mp < 10“^®, we should doubt the SM. 
In contrast, there is no reason to doubt supersymmetric or natural theories because of the 
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little-hierarchy problem — there is no alternative explanation under which the data is more 
probable. We cannot consider whether a supersymmetric or natural theory is probable — we 
must consider whether it is more or less probable than alternative explanations, such as the 
SM. 

We found that, from a Bayesian perspective, there is no support for the argument that 
the SM is a fundamental theory. This exacerbates hne-tuning problems because whereas the 
SM makes an awful generic prediction that the weak scale is close to the Planck scale, such a 
theory makes no generic predictions for the weak scale. 

In summary, the little-hierarchy problem is a little problem; we cannot reject the concept of 
naturalness because of the absence of new physics near the weak scale. This faulty reasoning 
stems from a misunderstanding of the foundations of naturalness arguments. As frustrating 
as it is, until we hnd it or reach the highest energy scales, the most probable theories, such 
as supersymmetric theories, predict new physics that is always just around the corner and 
we should remain positive about the prospects for new physics in the second run of the LHC. 
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